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Quantum process tomography is a necessary tool for verifying quantum gates and diagnosing 
faults in architectures and gate design. We show that the standard approach of process tomography 
is grossly inaccurate in the case where the states and measurement operators used to interrogate the 
system are generated by gates that have some systematic error, a situation all but unavoidable in 
any practical setting. These errors in tomography can not be fully corrected through oversampling 
or by performing a larger set of experiments. We present an alternative method for tomography to 
reconstruct an entire library of gates in a self-consistent manner. The essential ingredient is to define 
a likelihood function that assumes nothing about the gates used for preparation and measurement. 
In order to make the resulting optimization tractable we linearize about the target, a reasonable 
approximation when benchmarking a quantum computer as opposed to probing a black-box function. 



INTRODUCTION 

The design and control of scalable quantum architec- 
tures at the level of precision necessary for fault toler- 
ant computation remains a considerable challenge. There 
has been remarkable progress in numerous physical sys- 
tems (e.g. superconducting circuits [IH3], atomic systems 
[HIS], etc.) to improve metrics such as the gate fidelity 
[6], and incrementally these values are approaching the 
fault-tolerant threshold. Diagnosing errors and improv- 
ing designs of gates and architectures however, often re- 
quires much more information than a single scalar value. 
Instead we need a full characterization of a quantum pro- 
cess, and that necessitates quantum process tomography 
(QPT) 0. 

The essential idea of QPT is as follows: prepare an 
initial quantum state, apply the operation we would like 
to reconstruct, measure the expectation value of some 
observable and then repeat for different initial states and 
observables until it is possible to retrodict the process 
through some form of matrix inversion. This scales ex- 
ponentially poorly with the number of qubits, n, since 
the parameters necessary to specify a general map are 
0(2 4 ™), but generally we only need to perform QPT on 
small subsystems (one or two qubits) of a larger archi- 
tecture and can then verify that the subsystems are iso- 
lated through other means [5]. Additionally, due to the 
fact that there will be errors on the measurement out- 
comes, if for no reason other than finite measurement 
statistics, the resulting process may not be physical (i.e. 
a completely positive trace preserving map). When this 
non-physicality arises from Gaussian noise on the mea- 
surement outcomes it can be corrected using maximum 
likelihood estimation or through Bayesian inference tech- 
niques [9UT6]. 

In addition to stochastic measurement noise, quantum 
process tomography should be consistent with respect 
to systematic noise. Most quantum computing architec- 



tures allow only for fixed initial states and measurement 
operators so that the state preparation and measurement 
(SPAM) described for tomography involves the applica- 
tion of gates that may have the same degree of error as 
the process being interrogated. These errors in QPT can 
not be corrected by gathering more measurement statis- 
tics or enforcing physicality alone. Here we show that 
with standard maximum likelihood techniques these er- 
rors, and in particular coherent errors, lead to fidelity 
estimates from QPT that can be very poor. Even more 
problematic is that the ratio of the QPT to SPAM errors 
actually can grow as the SPAM error decreases implying 
that as the gate fidelities approach the threshold it can 
be even harder to estimate them correctly. SPAM er- 
rors can be accounted for in fidelity estimation by using 
randomized benchmarking \T7 20J but this does not 
provide full tomographic information about the gates in 
order to diagnose and correct errors. We observe a dis- 
agreement between benchmarking |20j and tomography 
PQ experiments on similar samples which suggests that 
SPAM errors are a current limitation that needs to be 
addressed. 

In this manuscript we explore techniques to compen- 
sate for SPAM errors while still obtaining a full charac- 
terization of our quantum gates. One simple technique 
is to oversample, not by increasing the repetitions of a 
given experiment, but by increasing the number of gates 
used for SPAM. We show that this technique shows no 
improvement for general unitary errors, however it does 
yield an improvement for a uniform error such as a global 
frame transformation. The primary subject of this work 
is an alternative approach to tomography that places 
the SPAM gates on the same footing as the gate un- 
der investigation, and then retrodicts an entire library 
of unitary gates in a self-consistent manner. The 'cost' 
of this method in terms of the number of independent 
measurements and post-processing is polynomially equiv- 
alent to performing standard QPT on each gate in the 
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library being reconstructed. Our approach assumes very 
little about the gate error model, in contrast to the self- 
correcting state tomography proposed in [21] or the boot- 
strap method in [22j which otherwise compensate for the 
same SPAM issues. 

The remainder of the manuscript proceeds as follows. 
In Section | we give a detailed description of QPT and 
simulate the effects of SPAM errors for a range of er- 
ror models and system parameters. In Section | we show 
the effects of oversampling using unitary 2-designs and in 
section]] we develop self-consistent gate-set tomography. 
Finally, in section | we apply our self-consistent tomogra- 
phy to experimental data obtained from two independent 
experiments on a system consisting of a single supercon- 
ducting qubit. 

THE TROUBLE WITH TOMOGRAPHY 

A general quantum process on a finite d-dimensional 
Hilbert space Hd (d — 2™ for qubit systems) is a com- 
pletely positive trace-preserving (CPTP) map. There 
are many ways to represent such a map, A, but in this 
manuscript we will primarily use two of them: the Choi 
matrix [23j which is given by 

p A = ^E ij ®A(E ij ), (1) 

ij 

where E^ = and the Pauli transfer matrix (PTM) 

[I], which is 

(R A ) ij = ^Tr(P i A(P j )), (2) 

where Pj denotes the set of Pauli matrices, 
{I, X,Y, Z}® n , though one could extend this to qudit 
systems by considering any Hermitian basis for the Pj's. 
The transformation between these two representations 
is the linear mapping, 

{R A )ij =Tr (p A Pf ® P k ) , 

PA=^f£(i*A)«i? ' 0Pfe ' ^ 
ij 

and each representation have useful properties. The 
Choi matrix is positive semidefinite if and only if A is 
completely positive. If we define Pq = Id, then the 
map is trace-preserving if and only if the first row of 
1Z A is the vector (1,0,0,...) and is unital if the first 
column has the same structure. Furthermore, in the 
PTM picture composition becomes matrix multiplica- 
tion, 1Z Ai oa 2 = 7^Ai7^A 2 ! since the PTM is the stan- 
dard superoperator group representation defined over the 
Pauli basis. 

Let us define an experiment by the triple {pj, Mj, rrij} 
which describes preparing the system in state pj measur- 
ing the operator Mj and obtaining an expectation value 



rrij according to 

m^Tr^A^)). (4) 

We can define a vector | p) ) whose elements are ((j\p)) = 
Tr(P j p)/d as well as ({M\j}} = Tr(MPj) (the omission of 
the dimensional factor in the measurement is intentional 
and is a consequence of bounding the values of the 1Z 
matrix between ±1). In this form, 

mj = ((MMp,)) = Tr{\Mj))(( Pj \ T n A ). (5) 

This final expression is simply another inner product of 
the matrix 1Z A with the super-operator \Mj))((pj\. For 
the vectorization of the superoperator 1Z A we use the 
notation r A as the column major vectorized 1Z A and Sj 
as the vectorized \Mj))((pj \ so that 

mj = sJr A . (6) 

We can now express the entire set of experiments as 

m = S T r A , (7) 

where the columns of the rectangular matrix S are the 
vectors Sj. 

The first estimate for our quantum process, rbarej 
arises from linear inversion. While S is typically not 
square we can invert SS T or, in the case of an incom- 
plete set of measurements, find its pseudo inverse. This 
leads to the bare estimate 

r b arc = (SS T y 1 Sm. (8) 

The astute reader may have noticed that at no point 
have we enforced that our estimate i^are correspond to a 
physical map, and we will find that for many error models 
it does not. 

The dominant source of error, in the tomography lit- 
erature at least [9HIB]. is statistical error due to finite 
measurement statistics. Instead of direct access to rrij 
we measure 

m j = s]v A + ^fN j W j , (9) 

where Wj is a Gaussian random variable with mean zero 
and unit variance and Nj is the noise power. Due to 
the central limit theorem we can always approximate 
the error model as Gaussian if we repeat the measure- 
ment sufficiently many times in which case Nj scales as 
l/(# of repetitions of experiment j). The transforma- 
tion m'j — rrij / y/Wj and = Sj / y / Aj simplifies the 
Gaussian likelihood function to 

£(m'|r) = exp ^ £ \m'j s'Jrf j . (10) 

For this error model the most rigorous estimate of the 
gate would be to perform some sort of Bayesian estima- 
tion where rBayes = / r£(m'|r)dr/ J £(m'|r)dr but these 
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integrals are often extremely hard to calculate, especially 
since there is no uniquely good measure over quantum 
channels dr. A much simpler technique is to report the 
maximum of the likelihood function. If the global max- 
imum lies inside the space of physical maps then it is 
simply the rbarc from Eq. Q , and if most of the support 
of £(m'|r) is also physical then the Bayesian estimate 
and the maximum likelihood estimate will be be approx- 
imately the same. In the case where the bare estimate is 
unphysical maximizing £(m'|r) over the space of physi- 
cal r is a semidefinte program [53] and can be solved for 
small instances easily with optimizers such as SeDuMi 
[25] . This is because the likelihood function is of the 
form of a 2-norm distance between vectors m' and S' T r 
and the constraints are that the Choi matrix is positive 
scmidcfinitc. We use the approach outlined in the supple- 
ment of [T] which is an extension of the state tomography 
techniques in [26] [27] as well as [12] . 

While there are issues regarding maximum likelihood 
estimation for QPT, there is a deeper problem that af- 
fects any reconstruction method that utilizes the like- 
lihood function. To calculate the likelihood function we 
require Sj which in turn implies we have a complete char- 
acterization of pj and Mj. In any experimental imple- 
mentation of tomography there will be SPAM errors and 
in many situations the magnitude of the SPAM errors is 
the same order as the size of the gate errors we are trying 
to estimate. If the SPAM errors are stochastic, then the 
situation is reducible to the previous case by effectively 
treating the SPAM errors as additional sampling noise 
on the measurement. When the errors are systematic 
we may write \pj)) TZ £j \pj), \Mj)) TZjr. \Mj) and 
therefore Sj —> TZ$ j ® IZjr.Sj. 

For the remainder of this manuscript we will consider 
a slightly less general form of tomography that is ap- 
plicable to many experimental implementations: a fixed 
initial state \po)) and measurement operator ((Mo| and 
a library of gates G = {7£i ,7^2) ■ ■ ■ TZn}- In this picture 
we describe experiments according to the convention 

rriij = ((MoI^j^a^Ipo)) = ({M^K^pi)). (11) 

Systematic errors are described G( crr ) = 
{R,£ x TZi, 1Zg 2 1Z2, ■ ■ ■ 'R.£ n 'R.n}i under the assump- 
tion that the initial gate-set was composed of unitary 
maps. 

In Fig. [I] we simulate the effects of both systematic 
and stochastic measurement noise in the case where the 
measured gate is a perfect identity gate. This test is a 
good primitive for both theory and experiment since the 
identity is the one gate that should be perfectly implc- 
mentable in any experiment by immediately performing 
measurement after state preparation (i.e. doing nothing 
for no time). In this figure the systematic SPAM error 
comes from a depolarizing channel of varying strengths 
£dcp- We do not impose the CPTP constraint on the out- 
come and therefore measure the difference between the 
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FIG. 1. (color online) Reconstructing a perfect identity gate 
with imperfect tomography due to depolarizing errors on the 
SPAM gates as well as Gaussian random noise on the measure- 
ment outcomes. We plot the diamond norm distance between 
the bare estimate Eq. |8| and the identity versus the noise 
power. In A) we vary the strength of the depolarizing error 
for a fixed gate-set with four elements corresponding to map- 
ping the ground state to the four corners of a tetrahedron. 
The experimental noise power is obtained from [T]. In B) we 
fix the depolarizing strength at £<±c P = 10 _3 and vary the set 
of gates used for SPAM over four sets of gates of differing 
order. 

reconstructed gate and the identity by a diamond norm 
distance [33], which is calculated through the semidefi- 
nite programming technique in 29J. In this simulation 
the gates map the |0) state to the four points on a tetra- 
hedron which is essentially the most symmetric, minimal 
set of gates. We observe that for large noise powers the 
error in the estimate decreases exponentially with respect 
to decreasing noise power (and thus increasing repeti- 
tions) until it hits a floor determined by the systematic 
error in the interrogating gates. 

In Fig. [Tj3 we look at a similar plot for different li- 
braries of unitary SPAM gates. The first two gate-sets 
are defined in terms of the set of states to which they 
map the qubit ground state |0): a tetrahedron or the six 
cardinal directions on the Bloch sphere. The second two 
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FIG. 2. (color online) The ratio of the error in reconstructing 
the identity to the average gate error plotted versus the aver- 
age gate error for a single qubit using a tetrahedral gate-set 
and no stochastic measurement noise. The estimated opera- 
tion is the 'closest' physical map to the bare reconstruction. 
Seven different error models are compared, four coherent er- 
rors (different random unitary maps applied to each gate, a 
single random unitary map applied to each gate, a detuning 
error and over-rotation) as well three incoherent processes 
(amplitude damping, dephasing and depolarizing errors). 



groups are unitary 2-designs: a twelve element subgroup 
of the Clifford group and the entire 24 element Clifford 
group. Both of these groups correspond to the solid rota- 
tions of a cube, with the twelve element subgroup consist- 
ing of 180° rotations about the faces and 120° rotations 
about the corners of the cube. Generically, increasing 
the number of gates and maximizing the entropy of SS T 
speeds convergence to the estimation floor, but the actual 
limit remains unchanged. 

By introducing systematic errors into our interrogating 
gate-set the maximum of the original estimator Eq. Q 
does not yield a faithful reconstruction of the unidentified 
gate. In fact, the resulting estimate can be highly non- 
physical in the absence of any stochastic noise. Semidcf- 
inite programming techniques constrain the estimator to 
the space of physical maps using the covariance matrix 
of the Gaussian likelihood function as a metric which we 
use to minimize the distance between a physical state and 
the unphysical estimate. However, if this likelihood esti- 
mator is incorrect in the absence of any stochastic noise 
then there is no reason to trust the covariance matrix 
metric over any other metric on operator space (e.g. the 
flat metric) . Minimizing the distance over the flat metric 
can be preformed using the methods in [15] , which was 
proven to be optimal. Since there is no gain in using the 
covariance method for our problem we use a flat metric 
in the following simulations due to its ease of calculation. 

In Fig. [2] we look at the error in reconstructing the iden- 
tity, in the absence of stochastic measurement errors, for 
different models of the systematic errors. We find the 
closest physical state to the estimate under a flat metric. 
Since the two gates under comparison are physical we can 



compare them with the gate fidelity, F, or more precisely 
the gate error, 1 — F. It is informative to plot the ratio 
of the reconstruction error to the average SPAM error 
on the individual gates, versus the average SPAM errors. 
There is a stark difference between coherent errors (over- 
rotation, detuning errors etc.)and incoherent error mod- 
els (Tl or T2 processes for example). In the incoherent 
case the error in estimating the identity is proportional 
to the error on the individual gates while for the coherent 
model the ratio of the errors grows exponentially poorly 
as the gate error decreases. 

The dependence of the tomography error on the gate 
errors has dramatic implications for benchmarking quan- 
tum gates with QPT. A quantum computing architecture 
viable for fault-tolerant error correction will require gates 
with error rates in the range of 1CT 3 to 1(T 5 . From Fig. [2] 
we see that the measured error rates for a fault-tolerant 
gate-set will be more like 10~ 2 to 10~ 3 if the dominant 
source of errors are coherent. In fact, measured error 
rates of 10~ 5 are not even present in Fig. [2] and corre- 
spond to physical gate error rates smaller than 10~ 8 . In 
many cases the dominant source of error is T\ or T2 type 
errors, in which case this effect will be minimal, how- 
ever, as coherence times increase coherent errors will very 
likely become a bottleneck. From Fig. [T] we see that for 
the typical noise power in current experiments we expect 
SPAM errors to dominate the estimate if the gate error 
is less than about a percent, which is well above the error 
due solely to Tl or T2. 



QPT WITH OVERSAMPLING 

From the previous section one can observe that to elim- 
inate fully stochastic errors it is sufficient to oversample, 
repeating experiments many times in order to accurately 
measure expectation values. Oversampling is also an in- 
tuitive approach to dealing with SPAM errors if instead of 
increasing repetitions we alternatively increase the num- 
ber of independent experiments (e.g. by increasing the 
size of the gate library G). In this section we show that 
this method does not compensate for general SPAM er- 
rors, however when the gate library is a unitary 2-design 
some uniform SPAM errors are removed. 

Starting from Eq. ([5]) we can derive the bare estimate 
in a slightly different manner that will be more conducive 
to the following discussion. First, note that we can mul- 
tiply both sides by \Mj))((pi\ and sum over i and j to 
obtain 
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FIG. 3. (color online) Simulated error in reconstructing the identity versus average SPAM error in terms of the gate fidelity 
(left) and the diamond norm distance (right). We look at four gate-sets and 3 types of SPAM error. 



In this case the bare estimate is 



n 



(est) 



(13) 



x (X>i'>>(</V|j • 

If there are SPAM errors the term m^- has the form 

rriij = ((M \Tle i n j Tl A Tl£ i Tl i \p )) (14) 
which leads to, 

^i° st) = \M f ))((M f \j ^^J|Mo»«Mo|^ £ .^ 



There are some very interesting consequences of the 
final expression which we shall examine in a few special 
cases. If the SPAM error is independent of the gate, then 
due to our error conventions we get a cancellation of the 



final term in Eq. ( 15 ) yielding, 

>. -1 

n ( T t) = \T / \M j )){{M j 

x K A K £ . 



(16) 



We could derive similar expressions if the constant error 
occurred before the gate or in some combination of pre 
and post-gate error. If in addition the error commutes 
with all of the gates in the library (as is the case with 
depolarizing errors) the result is, for example 



' — ^•dep'^-A'^dcp ■ 



(17) 



independent of the set of gates used for tomography. This 
form of the reconstructed map lends some intuition to 
Fig. [2] where the depolarizing error and other incoherent 
errors lead to a tomography error that had polynomial 
scaling with respect to the error on the gate-set. Qual- 
itatively, errors of this form 'commute' with the recon- 
struction procedure. 

If the error does not commute with all of the gates 
but the library forms a unitary 2-design |30j we can also 
simplify equation Eq. (16). We start by independently 



(15) analyzing the two quantities in parenthesis in Eq. (16 1 



These sums are both twirls over a unitary 2-design, W, 
which have the following generic form 

W(A) = \I)){{I\{{I\A\I))+(I - |/))«7|) MA)^((I\A\I)) 

(18) 

[S]. Therefore, the first factor is given by 



W(|Afo))((Mo|) = |/»((/|((/|M )) 2 

«Mo|M }) - ((/|Af )) 2 



(i-wxcn) 



(19) 



d 2 - 1 
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and the second by 



W(\M ))({M \n £ ) = \I))((I\((I\M ))((M \K £ \I)) 

(( M \n £ \M )) - ((I\M ))((M \K £ \I)) 
d 2 -l 

(20) 



(i-wxai) 



Both of these are proportional to depolarizing channels 
which generically have the form |/))((/| + e(I — |/))((/|). 
The product W(|Af ))((M |)- 1 W(|M ))((Af ^|) is 
therefore proportional to a depolarizing channel with a 
strength 



((M \Ke\M )) 
((M \Ke\I)){{I\M a )) 



- 1 



((Mo | Mo)) 
«/|M )) 2 



1 



and proportionality constant 



((M \n £ \i)) 

((I\Mo)) 



(21) 



(22) 



The bare estimate for a constant error on a gate-set com- 
posed of a unitary 2-design is thus given by 



7?.l est) = aKi dep) KATl £ . 



(23) 



The previous theoretical analysis suggests that there 
may be a fundamental difference when the gate library is 
a unitary 2-design, and this is confirmed in Fig. [3] When 
we measure the errors in terms of the gate fidelity we see 
a clear difference with respect to the unitary 2-designs. 
In that case a global unitary error has the same effect on 
the fidelity as a depolarizing error, as expected from the 
previous argument. QPT errors resulting from indepen- 
dent unitaries on each of the gates remains qualitatively 
the same for all four sets. What we describe as a global 
unitary error is a strange object and should not be con- 
fused for control errors such as over rotation or detuning 
errors, but is instead more akin to a global frame mis- 
alignment. 

Interestingly, the errors look very different when mea- 
sured by the diamond norm (the right and left plots of 
Fig. [3]) . The diamond norm distance of the reconstruc- 
tion scales polynomially with the gate-set diamond norm 
error for all forms of the SPAM error and gate-sets. The 
fact that the rate of the depolarizing error spans twice 
as large a domain as the coherent errors in the diamond 
norm is also curious. These two metrics clearly yield 
very different results and it is an important question as 
to which is appropriate for an experiment to report. For 
this reason, we consider both for the remainder of this 
manuscript. 



is based on the likelihood function, and we have used the 
wrong one. A proper likelihood function should incorpo- 
rate the SPAM errors, but characterizing the SPAM er- 
rors themselves requires QPT and therefore a likelihood 
function. This is an impasse for standard QPT methods. 
The alternative we present in this section is to assume 
nothing about the SPAM errors (though later we will 
bound their magnitude) and instead estimate an entire 
library of gates in a self-consistent manner. 

Let us write the set of experimental gates as G' cxp - ) = 
{1Z £i 1Zi,1Z £2 1Z2, ■ ■ ■ 1Z £n 1Zn} which is a faulty version of 
the library G( idcal ) = {Ki,K 2 , . . .K N }. The free param- 
eters of the this model are the error terms lZ £l while the 
ideal gates are fixed. We begin by performing a series of 
N 3 experiments of the form 

m ijk = {{M Q \n £k n k n £3 n 3 n £i n t \ Pa )). (24) 

which in essence consists of the tomography data neces- 
sary to reconstruct each element of G^ exp - 1 , 

The likelihood of a trial set of gates G = 
{R £l Tli,TZ £2 Tl2, ■ ■ ■ H£ n Hn} is now given by 



C(G) =cxp ( - ^ \mjk 



((M \ii £k n k n £j n 3 n £i ni\p ))\ 



(25) 



The constraints such that the gate library is physical is 
that each of the maps lZ £j are completely positive and 
trace preserving. Finding a maximum likelihood estimate 
is equivalent to minimizing the least square estimator, 



((M \ne k n k K £j Tl j 'R. £i lli\po)) 



LSQ(G) = ^2 m ijk 

i,j,k 

(26) 

but is highly non-trivial due to the fact that this function 
is 6 th order in the gate-set. 

We can reduce the order of the estimator by lineariz- 
ing the evolution about the ideal set of gates. This is 
a reasonable assumption if we are testing components of 
a faulty quantum computer as opposed to probing some 
unknown physics or interaction. In this situation we al- 
low 1Z £ = IZi + £ j where £j is not necessarily physical 
but is small, \\£j\\ <C 1. Expanding the evolution to first 
order in £ we have 



fhijk =((M \TZ k TZ J TZ l + £ k TikTljTli 

+ TLk^TljUi + KkR>AKi\po)), 



(27) 



SELF-CONSISTENT TOMOGRAPHY 

We have thus far failed at tomography in this 
manuscript for the simple reason that our methodology 



or by transforming back to the physical lZ £j 

fhijk =((Af | - 2n k njn i + K £h n k n j n i 



(28) 
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FIG. 4. (color online) A comparison of standard QPT and the self-consistent method presented in this manuscript for individual 
random unitary errors for the set of gates mapping to the six cardinal directions. We plot both the diamond norm and fidelity 
error between the estimates and the actual gates (averaged over the set of gates) plotted versus the equivalent distance of the 
actual set from the ideal gates. We show the magnitude of the tomography error and the ratio of the error with respect to the 
actual gate error for both cases. 
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FIG. 5. (color online) A comparison of least-square estimator 
(negative log-likelihood function) for standard QPT and the 
linearized likelihood estimation method with SPAM errors are 
generated by random unitary maps of each gate. On the x- 
axis is the estimator of the ideal gate-set in the absence of 
errors with respect to the measurement outcomes and on the 
y-axis the ratio of the estimated gate-set's log likelihood to 
the ideal. 



Therefore self-consistent reconstruction of a set of gates 
can be described as the following least square estimation 

min ^\m ij k + ((M \2?R. k K J Tli-'Rc h 1l k 7l j Tli 

|2 



(29) 



subject to the constraints that the error maps IZg are 
all physical. 

Minimizing this linearized least-square fit under the 
physicality constraint is another example of a semidcf- 
inite program. In fact, it is isomorphic to the orig- 
inal QPT problem on a larger space where TZ^ — > 
Rs 1 © Rs 2 © . . . R,£ N . There is however one major dif- 
ference which is that the optimal fit is no longer unique, 
which implies that the matrix corresponding to SS T in 



this space is not full-rank. As an example take any 
unitary transformation G < - ldca1 ' = {TZi, 7Z2, ■ ■ ■ TZn} — > 
G (orr) = {TZ^TZ 1 TZ u ,n]}n 2 n u ,...n^TZ N TZ u } where 
TIu\po))((Mq\7£u = \po}}((M \. The measurement out- 
comes are invariant to such a transformation since 

m ijk =Tv(|po>>((Mo|7^ftfe) 

^Tr(\p ))((M \K%Tl i K u KZK j Tl u 'R%nkn l 



^(KulpoMMolKlKilljKK, 



m ijk 



(30) 



When we prepare and measure in the computational ba- 
sis this is equivalent to a frame invariance with respect to 
a diagonal unitary transformation. This is not analogous 
to moving to the interaction picture since the resulting 
frame is stationary, and does not generate an inertial cor- 
rection to the Hamiltonian. It is more akin to an energy 
rescaling. Such a frame is physically irrelevant and in 
the following simulations we calculate distance between 
gate-sets by optimizing to find the most generous frame. 

In Fig. [4] we compare the results of standard QPT and 
our self-consistent approach. The error model is that 
of individual random unitary errors, the uncorrectable 
error from the previous section, and the gate library is 
the six element gate-set that maps |0) to the cardinal 
directions on the Bloch sphere. We plot the errors from 
both the fidelity and the diamond norm of the estimates 
versus the error of the physical gate-set with respect to 
the ideal target. For each we show the average result for 
the six gates. Additionally, we also plot the ratio of the 
error in the estimate to the ideal for both measures. 

In both cases there appears to be an exponential im- 
provement in accuracy over standard QPT with respect 
to decreasing gate error. This is fundamentally different 
from the results of the previous section where the fidelity 
error changed substantially but the diamond norm re- 
mained qualitatively the same, and so this method is not 
an effective depolarization of the error. 

To clarify the differences between the two approaches 
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we plot the least square estimator Eq. ( 25 1 for the two es- 



timates in Fig. [5] Again we look at the ratio of the value 
of the least square fit for the estimated value divided by 
the fit of the ideal gates to the data. The estimate from 
QPT is flat with respect to lowering gate error which cor- 
responds to the fact that QPT is minimizing the wrong 
quantity. Our estimator does not properly minimize this 
quantity either, due to the linear approximation, but the 
scaling is dramatically more favourable. 

There are some obvious shortcomings of this method. 
We have increased the number of experiments and the 
complexity of the reconstruction algorithm. These in- 
creases are both polynomial in the dimension of the sys- 
tem, but since the dimension is already exponential in the 
number of qubits the resulting overhead is non-trivial. In 
principle, we can apply these techniques to two or three 
qubit systems. However, the reconstructions in Fig. [4] 
are already taxing the default numerical precision of our 
semidefinite optimizer. If we extended the plot in Fig. [5] 
to the left we would very quickly see the self-consistent 
method flatten from the numerical tolerance. Also, the 
maximization of diagonal frames is computationally ex- 
pensive, generally more so than the SDP solver itself. 



EXPERIMENT 

We apply our self consistent tomography method to 
data generated from two independent experiments with 
single junction transmon (SJT) qubits. Sample A is a 
SJT capacitively coupled to a coplanar waveguide res- 
onator (used also in reference [3] as sample b), while 
sample B is mounted in a cavity machined from bulk 
copper. In both cases the qubits are driven by resonant 
microwaves that are shaped in order to perform optimal 
qubit gates. For these experiments we use the gate-set 
{J, X w , X n n, 5^/2} where the notation Rg denotes a ro- 
tation about the axis R of angle 9. See references [TJ [3T] 
for the design and construction of these systems. 

In sample A(B) the qubit frequency was uj q /2-K = 
4. 7610(4. 4738)GHz and was coupled to a resonator of fre- 
quency Lo r /2n = 7.4269(12. 118)GHz. In sample A there 
was a second qubit coupled to the resonator with fre- 
quency uj q /2n = 5.3401 that played no role in the fol- 
lowing experiment since the two qubits were only weakly 
coupled. The coherence times of the two samples were 
Ti = 8.1(44.0)jus and T 2 * = 16.2(7.2)/is. The samples 
are radiation shielded and cooled to 15mK in a dilution 
refrigerator. 

In TableUwe show the results of both estimation meth- 
ods as compared to the desired gate. In both cases we 
find that QPT overestimates the error in both the dia- 
mond norm and the gate fidelity as compared to the self- 
consistent method. For sample A the error on the QPT 
was roughly 1.5 times the self-consistent error while for 
sample B the it was roughly 3 times as large. We show 



plots of the Pauli transfer matrices for both estimates 
and the ideal maps in Fig. [6] for sample B. 

Furthermore, since the gates in our library were gen- 
erators of the Clifford group we were able to perform 
randomized benchmarking on the two samples using the 
methods in [181 119| . The fidelity error from randomized 
benchmarking was even smaller than the error from the 
self-consistent reconstruction, dramatically so in the case 
of sample B. We conjecture that this is due to errors in 
the measurement operator Mq, since the calibration of 
the measurement voltage to the |0) and |1) states has 
been observed to shift over the course of long experi- 
mental runs. Randomized benchmarking is insensitive to 
this type of error while the methods presented in this 
manuscript are not. Accounting for errors in the initial 
state pq and the fixed observable Mq are both important 
future avenues to extend this method, as well as that of 
dealing with slowly time-varying errors. 



CONCLUSION 

The methods of tomography are very powerful when 
using a well-characterized system to probe a black-box 
operation, but this is rarely the case when verifying quan- 
tum computing hardware. Instead, the situation is one in 
which all components are faulty, and in that scenario we 
have shown that process tomography can fail dramati- 
cally. Instead of separating the preparation and mea- 
surement phases of tomography we instead must view 
the experiment as a sequence of faulty gates, and while 
we have not discussed errors on the fiducial state and 
fixed measurement operator, those too should probably 
be brought into question in the future. Without incor- 
porating these errors into our model it is impossible to 
obtain a more accurate reconstruction through statistical 
analysis alone. 

In this manuscript we have developed a new protocol 
that self-consistently reconstructs a library of quantum 
gates by modifying the likelihood function of quantum 
process tomography to incorporate our uncertainty of the 
state preparation and measurement gates. In simulation, 
this self-consistent method outperforms standard QPT 
in terms of the accuracy of estimation since QPT con- 
sistently underestimates gate fidelities in the presence of 
SPAM errors. This method requires the same number of 
experiments as QPT and only adds polynomial overhead 
to the amount of classical post-processing. In fact, in 
any case where tomography has been performed on the 
entire set of gates used for state preparation and mea- 
surement no further experiments are needed to apply our 
protocol. Colloquially, we have taken to calling this the 
'overkill' method of tomography, since the number of the 
experiments borders on the ridiculous for all but one or 
two qubit systems, but this will be the case for any pro- 
tocol that provides full information about a library of 
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Sample 


Gate 


1 - -F 9 (AqPT, Aideal) 


1 - F 9 (A S c, Aid C ai) 


1 - F a (RB) 


| Aqpt — Aidcal| 


||AsC — Ajdeal||o 


A 


/ 


0.0058 


0.0051 


- 


0.037 


0.029 


A 




0.013 


0.0098 


- 


0.075 


0.042 


A 


A^r/2 


0.0077 


0.0047 


- 


0.049 


0.018 


A 




0.0096 


0.0053 


- 


0.069 


0.068 


A 


(U) 


0.0090 


0.0062 


0.0029 


0.057 


0.039 


B 


I 


0.11 


0.037 


- 


0.43 


0.13 


B 




0.039 


0.0057 




0.31 


0.041 


B 


Xir/2 


0.040 


0.014 




0.37 


0.090 


B 


K/2 


0.032 


0.021 




0.32 


0.090 


B 




0.057 


0.020 


0.0016 


0.36 


0.087 



TABLE I. The gate error from QPT and self-consistent tomography for A) a SJT capacitively coupled to a coplanar waveguide 
and B) a SJT suspended in a copper cavity. We report the fidelity error and diamond norm for both approaches (QPT and 
self-consistent tomography), and for each gate in the set. The last row in each sample, labeled (U), is the average of the 
previous four as well as the average error obtained from randomized benchmarking. 




Input Pauli Operator Input Pauli Operator Input Pauli Operator Input Pauli Operator 



FIG. 6. (color online) The 7?.-matrices for the ideal gates and the reconstruction from QPT and self-consistent tomography for 
each of the four measured gates on sample B. 

maps. Self-consistently reconstructing a library of gates, Thomas Ohki and discussions with R. Blume-Kohout and 

as opposed to performing tomography on the individual Graeme Smith. We acknowledge support from IARPA 

members, yields more trustworthy and generally higher under contract W911NF-10-1-0324. All statements of 

fidelity estimates in both experiment and simulation and fact, opinion or conclusions contained herein are those 

we expect the difference between standard QPT and the of the authors and should not be construed as represent- 

self-consistent method presented here to grow more pro- ing the official views or policies of the U.S. Government, 
nounced as we reach lower and lower gate errors. 
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